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(54) Method and apparatus for manifesting characteristic existing in symbolic sequence 

(57) A method which manifests characteristic which is latent and can not be recognized, although it exists in a com- 
plicated symbolic sequence, for example, a nucleotide sequence of DNA, and thereby enables recognition of the char- 
acteristic unrecognized yet. is provided. 

When a symbolic sequence Ij (i= 1~m) is given, there is an effected conversion to a parallel sequence A(k) of par- 
tial symbolic sequences in which the suffix j is aligned in the following positional relation: 

3= 1, 2, k-1, 3c 

3« k+2, k+k-1, k+k 

J=. (n-l)k+l, (n-l)k+2, • (n-l)k+k-l, (n-l)k+k 

3« nk+1, nk+2, nk+k-1, nk+k 

and A(k) is formed with changing k to p, p+r. p+2r, p+3r , and the whole parallel sequences 5:A(k) is obtained. 

When regularity of period length k exists in the symbolic sequence Ij, the regularity remarkably appears in the par- 
tial symbolic sequences obtained by extracting one symbol at every k-1 synt)ols from the symbolic sequence. 
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Description 

BACKGROUND OF THE INVENTION 
5 Field of the Invention 

10001] The present invention relates to a method and apparatus for manifesting characteristic or regularity which is 
latent and can not be recognized, although such characteristic or regularity actually exists in a complicated symbolic 
sequence, for example, a nucleotide sequence of DNA. an amino acid sequence of protein, or a digital sequence of 
w decimal expansion of an irrational number and the like. In these sequences, regularity can not be recognized at a 
glance even when the regularity is included. The present invention enables recognition of characteristic or regularity 
included in the symbolic sequence but unrecognized yet. 

Description of the Related Art 

15 

[0002] Some complicated symbolic sequences contain characteristic which has not been recognized by human 
beings, although the characteristic actually exists. For example, genetic information is specified by a symbolic 
sequence of long string. The symbols consist of four symbols each indicating one of four kinds of nucleotides. A large 
amount of symbols are one-dimensionaily aligned. In the study of genetic information, it is extremely important to rec- 
20 ognize a certain regularity hidden in the symbolic sequence indicating genetic information. Besides, if a certain regu- 
larity is found in an irrational number, the number n, and the base of natural logarithm (e), the study of random numbers 
is intensified and various developments are expected in mathematics. 

[0003] For such a purpose, various trials have been made for analyzing a symbolic sequence based on a variety of 
mathematical methods such as the Fourier analysis. However, these trials have not necessarily accomplished success- 
es ful results. One problem with the conventional analysis methods is that even if a certain regularity is included in a part 
of a very long symbolic sequence, the regularity existing partially is buried in the whole sequence and can not be rec- 
ognized' when the whole symbolic sequence is analyzed. Since there is no effective technology to know in advance in 
which part of the sequence the regularity exists, there are many characteristics or regularity which can not be recog- 
- nized by the conventional analysis methods. - 

30 

SUMMARY OF THE INVENTION 

[0004] An object of the present invention is to contrive a method and apparatus which manifests characteristic or reg- 
ularity even if such characteristic or regularity exists only in a part of the whole symbolic sequence, and thereby enables 
35 recognition of characteristic or regularity which has not been recognized until now. 

[0005] Another object of the present invention is to manifest characteristic or regularity existing throughout the entire 
sequence. 

[0006] In one embodiment of the present invention, when a symbolic sequence Ij 0 = ^ 's given, there is an 
effected conversion to a parallel sequence A(k) of partial symbolic sequences in which the suffix j is aligned in the fol- 
40 lowing positional relation: 

3= 1, 2, 3c-l, Ic 
j= lc+2, " k+lc-1, k+lc 



so 



J= (n-l)lc+l, (n-l)k+2,- 
jp nk+1, nk+2," 

Instead, the positional relation may be the following 



{n-l)k+k-l, (n-l)k+k 
nk+k-1, nk-t-k. 
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1, 2, 
k+k, k+k-1. 



k-1, k 
k+2, k+1 



10 



15 



20 



25 



30 



35 



40 



j= (n-l)k+k, (n-l)k+k-l. 



nk+1 , 



nk+2. 



(n-l)k+2, {n-l)k+l 



nk+k-1 , 



nk-Kk, 



Herein, k represents an integer of 2 or more, n represents an integer such that nk<msnk+k , and when the suffix j is 
m+1 or more, the processing is ignored. 

[0007] Then, the converted parallel sequence A(k) is output using one or more expression means selected from hue, 
lightness and saturation of color arxi from interval, tone and volume of sound. 

[0008] Equidistant Letter Sequences in the Book of Genesis (Doron Witztum, Eliyahu Rips and Yoav Rosenberg, Sta- 
tistical Science 1 994, Vol. 9, No. 3. page 429-438) introduces a technology in which a code hidden in a one-dimensional 
letter sequence is decoded by converting the one-dimensional letter sequence to a parallel sequence A(k} of partial 
symbolic sequences. In this technology, there is required an operation to extract letters having senses from the parallel 
sequence A(k) of partial symbolic sequences, and it can not be used for other sequences than the letter sequence. Fur- 
ther, when a certain regularity is to be recognized in a symbolic sequence which is irregular at a glance and which is 
often found in a natural field, inconsistency, namely, regularity can not be recognized unless the regularity to be recog- 
nized has been recognized In advance, can not be solved. 

[0009] In the present invention described above, since a parallel sequence A(k} of partial symbolic sequences is out- 
put using one or more expression means selected from hue, lightness and saturation of color and from interval, tone 
and volume of sound, even if regularity is not known in advance, that regularity is manifested by a pattern of hue. light- 
ness and saturation of color and interval, tone and volume of sound and can be easily recognized. 
[0010] In another embodiment of the present invention, when a one<limensional symbolic sequence l-^Q^^-m) is 
given, there is an effected conversion to a parallel sequence A(k) of partial symbolic sequences in which the suffix j is 
aligned in the following positional relation: 



3 = 



1, 



k+1. 



2, 



k+2. 



k-1. 



k+k-1. 



k+k 



45 



50 



(n-l)k+l. (n-l)k+2,*'- 
J= nk+1, nk+2,"' 

Instead, the positional relation may be the following : 



(n-l)k+k'l, (n-l)k+k 
nk+k-1, nk+k. 



55 
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1, 



2, 



k-1. 



k 



k+k. 



k+k-1. 



k+2. 



k+1 



3= (n-l)k+k, (n-l)k+k-l. 



(n-l)k+2, (n-l)k+l 



nk+1 , 



nk+2 , 



nk+k-1 , 



nk+k. 



Further, when p represents a natural number from 2 to less than m, r represents any natural number, the above- 
described conversion is repeated witli changing k to p, p+r. p+2r. p^-3r, to obtain parallel sequences of partial sym- 
bolic sequences: A(p), A(p+r), A{p+2r). A(p+3r) . Then, the resulted parallel sequences: A(p), A(p+r). A(p+2r), 
A(p+3r) are further parallel-positioned to make a whole parallel sequences i:A(k). Then, the obtained whole par- 
allel sequences SA(k) is output. Here, n represents an integer such that nk<m ^nk+k , and when the suffix j is m+1 or 
more, the processing is Ignored. 

[0011] In this case, a parallel sequence made by parallel-positioning of p partial symbolic sequences, a parallel 
sequence made by parallel-positioning of p-K partial symbolic sequences, and parallel sequences made by parallel- 
positioning of likewise increased number of partial symbolic sequences, are all parallel-positioned. In this processing, 
if regularity of period length a is hidden in the symbolic sequence, such regularity is remarkably manifested in a parallel 
sequence A(a) made by parallel-positioning of partial symbolic sequences of a number of a. 
- [001 2] If a is included in between p, p+r, p+2r. p+3r, rows, regularity of period length a is manifested in a parallel 
sequence of partial symbolic sequences of an analogous number to a^- Therefore, increment r regarding the number of 
the partial symbolic sequences is not necessarily required to be one. and it may advantageously be any natural number. 
In this case, when the increment r is smaller, characteristic is more securely manifested. 

[001 3] In this embodiment, regularity of an unknown period length is manifested in a parallel sequence of partial sym- 
bolic sequences of some number, and recognition of characteristic becomes easy 

[001 41 In the above-described method, it is preferable that each symbol is expressed by combination of hue, lightness 
and saturation of color. By this embodiment, as a result of the manifestation of characteristic hidden in the symbolic 
sequence through visual sense, there will be made more sufficient understanding regarding characteristic hidden in the 
symbolic sequence, and various applications and developments utilizing the characteristic are made possible. Further, 
the resulting visual pattern is a pattern including regularity and irregularity mixed which has not conventionally existed, 
and a visual pattern of which design itself has utility application can be designed. 

[0015] Each symbol may be expressed by combination of interval, tone and volume of sound. By expressing the par- 
allel sequence by combination of interval, tone and volume of sound, a unique audio pattern is created and the charac- 
teristic of the symbolic sequence can be recognized through auditory sense. 

[001 6] When one symbol is taken out from an original symbolic sequence at an interval of k-1 (namely, at every k) to 
make a symbolic sequence and the method is applied to this extracted symbolic sequence, if regularity of period length 
k is hidden in the original symbolic sequence, the regularity is manifested and appears remarkably. 
[0017] When one symbol is taken out from an original symbolic sequence at an interval of kq-1 (namely at every kq) 
to make a symbolic sequence and the method is applied to this extracted symbolic sequence, If regularity of period 
length kq is hidden in the original symbolic sequence, the regularity is manifested and appears remarkably 

[001 8] Further, when any of the above-described methods are conducted with changing k to p, p+r, p+2r , there 

is formed a whole parallel sequences 2A(k) of a parallel sequence A(p) made by parallel-positioning of p partial sym- 
bolic sequences, a parallel sequence A(p+r) made by parallel -positioning of p+r partial symbolic sequences, and par- 
allel sequences made by parallel-positioning of likewise increased nunnber of partial symbolic sequences, and 
regularity of period length a appears remarkably in a parallel sequence A(a) formed by parallel-positioning of k (=a) 
partial symbolic sequences. Therefore, characteristic or regularity of unknown period length is manifested, and recog- 
nition of characteristic or regularity becomes easy 

[0019] According to this method, even if the period length a of regularity or characteristic is included between p, p+r. 
p+2r, rows, characteristic or regularity is manifested in a parallel sequence formed by parallel-positioning partial 
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sequences of a number approximated to a, and increment r is not necessarily required to be one. It becomes possible 
to manifest characteristic by small amounts of data processing, by selecting increment r adjusted to events. 
[0020] Further, by outputting analyzed results by color and/or sound, expressions suitable to event and observer 
become possible, and characteristic is more easily recognizable. The resulted color and/or sound pattern will be an 
5 interesting pattern in which regularity and irregularity are mixed, and the method can be utilized also as a designing 
method. 

[0021] Especially when an initiation position of regularity is situated at an analysis initiation position, the pattern of 
having a parabolic shape clearly appears, and regularity of a long period length is manifested clearly, in the whole par- 
allel sequences XA(k). 

10 [0022] The present invention will be recognized more successfully by reading the descriptions of the following exam- 
ples with the referring drawings. 

BRIEF EXPLANATION OF THE DRAWINGS 

15 [0023] 

Fig. 1 represents the whole parallel sequences SA(k) formed from a nucleotide sequence of human genomic DNA. 
Fig. 2 represents positional relation of suffix j in Fig. 1 . 

Fig. 3 represents the whole parallel sequences 2:A(k) formed from a numerical sequence showing n. 
20 Fig. 4 represents the whole parallel sequence £A(k) formed from a circulating numerical sequence of period 
length 18 (symbolic sequence). 

Fig. 5 represents the whole parallel sequences XA(k) formed from a circulating numerical sequence of period 
length 12 (symbolic sequence). 

Fig. 6 represents a part of the whole parallel sequences XA{k) formed from an amino acid sequence of muscle pro- 
25 tein myosin. 

Fig. 7 represents another part of the whole parallel sequences ZA(k) formed from the amino acfci sequence of mus- 
cle protein myosin. 

Fig. 8 represents still another part of the whole parallel sequences SA(k) formed from the amino acid sequence of 
- muscle protein myosin. 
30 Fig. 9 represents existing positions of vowel 'O' appearing in 'Genji Monogatari'. 

Rg. 10 represents placing positions in converting 100 symbolic sequence into the whole parallel sequences x:A(k). 
Fig. 1 1 explains pre-treatments for symbolic sequences. 

Fig. 12 represents another example of placing positions in converting a symbolic sequence into the whole parallel 
sequences i:A(k). 

35 Fig. 13 represents the whole parallel sequences z:A(k) formed from cDNA sequence of a G protein p subunit. 

Fig. 14 represents extraction of symbolic sequence I to be processed with changing the initial point, from a sym- 
bolic sequence M. 

Fig. 15 represents an example in which parabolic pattern appears in the whole parallel sequences EA(k) formed 
from a genomic DNA sequence of baker's yeast. 
40 Fig. 16 represents another positioning (reciprocal) pattern for obtaining a parallel sequence A{k). 

Fig. 1 7 represents the whole parallel sequences ZA(k) formed from a circulating sequence of period length 1 00. 
Rg. 18 represents constitution of an apparatus for effecting the method of the present invention. 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

45 

[0024] Experimental examples embodying the present invention will be introduced below. 
[First experimental example] 

50 [0025] Fig. 1 represents an experimental example for processing a symbolic sequence Ij showing nucleotide 
sequence of human genomic DNA. A symbolic sequence Ij indicating nucleotide sequence of human genomic DNA is 
formed by one-dimensional sequence of an enormous amount of symbols, each symbol indicating one of four kinds of 
nucleotides ATGC, and a certain regularity hidden therein is recognized as useful information. Therefore, it is a large 
object in genetic study to find regularity, or to specify a part of the sequence including the regularity. 

55 [0026] Fig. 1 represents a processed result output through color, and four kinds of symbols ATGC are, respectively, 
expressed by four colors red. blue, green and yellow. The original image of Fig. 1 is expressed in four colors. Fig. 1 rep- 
reserrts a result when the present invention is conducted with p=5 and r=1 . 

[0027] Referring to an example of a parallel sequence A(17) in which k=17, as shown in Fig. 2, longitudinal partial 
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symbolic sequences Cl, C2, C3 C1 7 which are extracted from a symbolic sequence Ij at every k and aligned lon- 
gitudinally, are laterally aligned to form a parallel sequence A(17). In CI , C2, C3 , values of suffixes] of symbols to 

be extracted are shifted by one. This rule is common to all k values and to all partial symbolic sequences C. 
[0028] In this example, a symbol group extracted at every k is placed longitudinally to form a longitudinal partial sym- 
5 bolic sequence, and the longitudinal partial symbolic sequences are laterally placed. However, longitudinal to lateral 
relations may be reversed, and a symbol group extracted at every k may be placed laterally to form a lateral partial sym- 
bolic sequence, and the lateral partial symbolic sequences may be longitudinally placed. 

[0029] In Fig. 1 , B16 remarkably shows that a repeating pattern of period length 16 exists in a part of the nucleotide 
sequence. From the pattern B16. it is possible to learn that there is a possibility that useful Information is included in this 

10 part, and this part is a area valuable to be analyzed in detail . B1 7 and B1 6 represent the same regularity. The regularity 
of period length 16 appears as the vertical stripes in B16, and appears as the inclined stripes In B17. The inclined 
stripes in B17 has a pattern in which the left side is lowered. B18 also represents the same regularity, and the inclination 
of the stripes in B18 is closer to horizontal than in B17. The same regularity is also shown in a parallel sequence A(1 9) 
in which k=19, however, in this case, the inclination is almost horizontal, and extraction of characteristic becomes 

75 increasingly difficult. 

[0030] Regularity of period length a appears vertically and is expressed most remarkably in A(a) in which k (=a) par- 
tial symbolic sequences are parallel-positioned. However, the regularity also appears in a parallel sequence of partial 
symbolic sequences in which k=a+1 and k=a+2 . Therefore, it is confirmed that the increment r is not necessarily 
required to be 1 . 

20 [0031] A18 shows regularity of period length 18, and the same regularity is shown as pattern A17 in which the right 
side is lowered in the parallel sequence A(1 7) in which k=17, and shown as pattern A19 in which the left side is lowered 
in the parallel sequence A(19) in which k=19. 

[0032] In addition, many remarkable patterns appear in Fig. 1 , and characteristics hidden in a nucleotide sequence 
of human genomic DNA are grasped from these patterns. 

25 [0033] Initial number p in the whole parallel sequences of partial symbolic sequences may be any natural number, 
and in Fig. r p=5. The increment r is not limited to 1. and it may be 2 or more. When r is smaller, characteristics are 
never failed to be found, and when r is larger, the amount of data processing is smaller. The increment r is not required 
to be constant, and it is preferable to select the increment r according to events. 
- [0034] Fig. 18 represents an apparatus for effecting the above-desoribed processing method, and in this apparatus. 

30 a symbolic sequence Ij to be analyzed is memorized in a memory apparatus 181 , and an apparatus 182 converts the 
symbolic sequence Ij into a parallel sequence A(k), and apparatus 183 forms the whole parallel sequences ZA(k) in 
which a plurality of parallel sequences A(k) obtained by changing the value of k are parallel-positioned, and an appara- 
tus 184 outputs the whole parallel sequences TA{k). The apparatus 182 and the apparatus 183 may be constituted of 
a computer and the apparatus 184 may be constituted of a color printer. When the whole parallel sequences ZA{k) is 

35 output using sound, a sound synthesizer may be used as the apparatus 1 84. 

[0035] It is preferable that Fig. 1 is expressed with a time lapse according to speed for processing the symbolic 
sequence. For example in Fig. 1 , color corresponding to II is first expressed on the left upper summits of A(5) to A(21), 

and further expressions of 12, 13. 14 are effected in succession. By using this change in time, characteristics are 

more easily recognized, and also in the case of output by sound, output with time lapse is effective, and when output 

40 with time lapse is conducted, characteristics are recognized through changes in sound. 

[0036] Fig. 3 exemplifies a result obtained by processing n symbolic sequence (numerical sequence), and 10 kinds 
of symbols (number) 0 to 9 are expressed by 10 equally divided colors of spectrum from a violet to red. It is found from 
the expression result of Fig. 3 that specific symbols (numbers) tend to appear frequently in a specific range. 
[0037] When noise input is processed as a row of a symbolic sequence and this symbolic sequence is processed to 

45 obtain similar pattern as in Fig. 3. it becomes possible to extract characteristic existing in the noise and to extract only 
meaningful sound included in the noise. Further, it is known that the pattern shown in Fig 3 can be used, for example, 
as a ground pattern for securities, and this complicated ground pattern can be specified by a one-dimensional symbolic 
sequence. 

[0038] Fig. 4 represents a result obtained by processing a circulating numerical sequence of period length 18. and 
so various patterns can be drawn according to the number k of a partial symbolic sequences to be fractionated. Various 
textile patterns can be designed by this pattern creating technology. Fig. 5 represents a processed result of a circulating 
numerical sequence of period length 12, and it is confirmed that different patterns from those of Fig. 4 can be made. 
According to this method, the complicated pattern shown in Fig. 3 and the regular patterns shown in Figs. 4 and 5 can 
be designed by the same method. Further, various patterns having utterly different impressions can be produced by 
55 changing corresponding relations of a symbol to a color. 

[0039] Figs. 6 through 8 represent a result obtained by processing a symbolic sequence which shows an amino acid 
sequence of a protein myosin of an adductor muscle of a scallop. In Figs. 6 to 8, basic residue is shown in blue, polar 
residue is shown in green, acidic residue is shown in red, and hydrophobic residue is shown in yellow. In Fig. 6, a 
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remarkable yellow longitudinal stripe appears in a parallel sequence in which k=7, and the existence of regularity of ng 
a period length 7 is found. This regularity of a hydrophobic residue of period length 7 corresponds to a-helix, and by this 
method, the existence of a-helix can be recognized and the existing position thereof can be specified. This a-helix is 
manifested as yellow longitudinal stripes in parallel sequences in which k=7, 14, 28 and 35, and manifested as yellow 
diagonal lines in parallel sequences in which, for example. k=22, 27 and 29. 

[0040] Fig. 9 represents an example expressing dots in positions where vowel *0' appears, prepared by applying the 
present invention to a symbolic sequence showing a row of vowels in Genji Monogatarr. The left side represents an 
analysis result of the Kiritsubo chapter, and the right side represents an analysis result of the Hahakigi chapter. There 
is manifested characteristic that appear with frequency in vowel 'O' as high in the specific range of the document, and 
as low in other speciffc ranges. By this method, extraction of characteristic in a letter information becomes easy 
[0041 ] Fig. 1 0 schematically represents processing contents for a symbolic sequence I j 0=1 to 1 00) to be processed. 
[0042] When, the period length of regularity to be extracted is known in advance, it will be easily recognized whether 
the regularity of the known period length k really exists, and in the case of existence, where it exists, by forming a par- 
allel sequence A(k) in which partial symbolic sequences obtained by division into k fractions are parallel-positioned. 
[0043] Even when the period length is not known, regularity of the unknown period length is manifested at some loca- 
tion in the whole parallel sequences. 

[0044] Fig. 1 1 represents an example of pre-processing for a symbolic sequence to be processed, and when part of 
a symbolic sequence J shown in (A) is processed, the part to be processed as shown in (B) will be the whole symbolic 
sequence I of the present invention. Further, when one symbol is specified by combination of a plurality of symbols, this 
method is applied for the symbolic sequence specified by combination of a plurality of symbols, for example, as shown 
in (C). Alternatively, it may also be permissible that one symbol is obtained from symbols of order 123 in a symbolic 
sequence K, then, one symbol is obtained from symbols of order 234 in a symbolic sequence K, this procedure is 
repeated to effect conversion into one symbolic sequence I, and this converted symbol I is processed by the method, 
as in the case for calculating moving average. Further, as shown in (E), for a symbolic sequence existing in a symbolic 
sequence at specific period, it may also be permissible that a symbolic sequence of this period is first extracted, and 
the present invention is applied to the extracted symbolic sequence. 

[0045] Instead of this method, processing as exemplified in Fig. 12 may be effected. In this method, one symbol is 
extracted at every kq. for a partial symbolic sequence of longitudinal direction. In the case shown in this drawing, the 
result is obtained by effecting the method with changing k to 2. 3, 4 • and q is fixed at 5. The result corresponds to the 
result when a symbol of an order of 5 • 1 0 • 1 5 • is first extracted, and then the extracted sequence is separated into 
k partial symbolic sequences, and the resulted partial sequences are parallel-positioned to obtain a parallel sequence. 
By this method, it is possible to manifest regularity further hidden in a symbolic sequence hidden in a symbolic 
sequence L (shown in (E) of Fig. 11). 

[0046] When a parallel sequence of partial symbolic sequences are obtained as described above, various method 
can be adopted for expressing the result, and a method in which a symbol is expressed by color, a method in which a 
symbol is expressed by variation in color density and a method in which a symbol is expressed by a character (two 
dimensional pattern) may be adopted, and further, the resulted line and row of symbols may also be expressed by 
sound. In this case, chord is made by an arrangement of symbols in line direction, and an arrangement in a row direc- 
tion is expressed by changing this chord by time. By this procedure, it becomes possible to grasp characteristic existing 
in a symbolic sequence through sound. 

[0047] The present invention is useful for analyzing various symbolic sequences, and useful in analyzing a nucleotide 
sequence of DNA, a nucleotide sequence of RNA, an amino acid sequence of protein, a numerical sequence, a letter 
sequence, a sound sequence and the like. By this analysis, it becomes possible to specify an existing position of useful 
information and to extract useful information. Further, when this method is applied to two symbolic sequences which 
can not be distinguished at a glance, characteristics are manifested, and the identity can be easily judged. In this sense, 
characteristics and regularity manifested in this method are not restricted to a repeating pattern having a certain period! 
and characteristics found in distribution of appearing sequence are also manifested. Further, the inaement r in the 
number of partial symbolic sequences is not necessarily required to be 1 , and further, it may not be a constant number. 
By effecting this method according to k1 . k2, k3 • • distributing irregularly, characteristics existing in two or more symbolic 
sequences are manifested, and the identity is easily judged. 

[0048] Fig. 13 represents the analysis result of a cDNA sequence of a G protein p subunit, and represents the result 
when the whole parallel sequences EA(k) is obtained when p is 5. In the original image of Fig. 13, GCTA are expressed 
by 4 colors and three apparent different color zones are recognized. 

[0049] The boundary 1 01 of the color zones corresponds approximately to the position of 1=281 . and the boundary 
1 02 of the color zones corresponds approximately to the position of j=1 303. In this case, it is known that a coding range 
exists in the range from j=281 to j=1303. and it is recognized that the coding range is easily specified through visual 
sense in this method. 

[0050] Fig. 1 4 represents a procedure to obtain symbolic sequence I to be processed when changing the initial point. 
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from a one-dimensional symbolic sequence M. For example, symbolic sequence 16 to be processed is a symbolic 
sequence obtained by extraction of M(6) and the following. 

[0051] When the present invention is performed on symbolic sequences. 11, 12, 13, 14 • thus extracted to obtain the 
whole parallel sequences ZA(k). a clear pattern may be obtained in the whole parallel sequences £A(k) corresponding 
5 to specific I. 

[0052] Fig. 15 represents one example thereof, and in the original image which is expressed in multicolor, a plurality 
of parabolic lines 151, 152, 153 appear. 

[0053] As a result of intensive study of this phenomenon, it has been recognized that the above-described line group 
appears when the initiation point of regularity coincides with the initiation point of the symbolic sequence to be proc- 
10 essed. By this, it has been known that the initiation point of regularity can be specified by utilizing appearance of a line 
group. 

[0054] Further, it is also known that the appearance gap of a group of lines 1 51 . 152, 153 and other group of lines 
161 . 162, 163 corresponds to regularity of an extremely long period, and it has also been recognized that the regu- 
larity of an extremely long period can be recognized by utilizing a line group. 

75 [0055] It has been recognized that the above-described pattern appears also by reversing alternately a sequential 
direction of partial symbolic sequences In a lateral direction (reciprocal positioning pattern). Fig. 16 represents a posi- 
tional relation for obtaining a parallel sequence A{k) by reversing alternately a sequential direction of partial symbolic 
sequences in lateral direction. Fig. 17 represents an example in which a circulating sequence having a period of 100 is 
converted to the whole parallel sequences rA{k) having the positional relation as shown in Fig. 16. and it is recognized 

20 that a clear line group appears- 

[0056] The above-described explanations are only some specific examples and the present invention can be used in 
various ways within the attached claims. 

Claims 

25 

1 , A method for manifesting characteristic existing in a symbolic sequence I ; Q = 1 - m) , comprising the steps of: 

effecting conversion to a parallel sequence A{k) of partial symbolic sequences in which the suffix j is aligned in 
the following positional relation: 

30 
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3 = 



1, 2r 



k+k-1, k+k 



J« (n-l)k+l, (n-l)k+2, (n-l)k+k-l, (n-l)k+k 

j= nk+1, nk+2, nk+k-1, nk+k 

or 

1, 2, k-1, k 

3= k+k, k+k-1, k+2, k+1 



3= (n-l)k+k, (n-l)k+k-l, (n-l)k+2, (n-l)k+l 

3= nk+1, nk+2, nk+k-1, nk+k; 

and 

outputting the converted parallel sequence A(k) using one or more expression means selected from hue, light- 
ness and saturation of color and from interval, tone and volume of sound; 

wherein, k represents an integer of 2 or more, and n represents an integer such that nk<m^nk+k , and when 
the suffix j Is m+l or more, the processing is ignored. 

A method for manifesting characteristic existing in a symbolic sequence I j Q = 1 -m) , comprising the steps of: 

effecting conversion to a parallel sequence A(k) of partial symbolic sequences in which the suffix j is aligned in 
the following positional relation: 
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1, 2,- 



k-1, k 
k+k-1, k+k 



J= {n-l)k+l, (n-l)k+2, (n-l)k+k-l, (n-l)k+k 

j= nk+1, nk+2, nk+k-1, nk+k 
or 

j= 1, 2, k-1, k 

j= k+k, k+k-1, k+2, k+1 



J= (n-l)k+k, (n-l)k+k-l, * {n-l)k+2, (n-l)k+l 

j= nk+1, nk+2, nk+k-1, nk+]< 



making a whole parallel sequences ZA(k) by further parallel-positioning of parallel sequences A(p), A(p+r), 

A(p+2r), A(p+3r) converted by changing k to p, p+r, p+2r, p+3r, , wherein p represents a natural 

number from 2 to less than m. and r represents any natural number; and 

outputting the obtained whole parallel sequences i:A(k). wherein n r^resents an integer such that 
nk<m^nk+k , and when the suffix j Is m+1 or more, the processing is ignored. 

The method according to Claim 2, wherein the whole parallel sequences SA(k) is output by using one or more 
expression means selected from hue. lightness and saturation of color and from interval, tone and volume of sound. 

The method according to Claim 2, further comprising the steps of: 

making the symbolic sequence Ij by extracting symbols sequentially from a symbolic sequence 
M g (s=l to u and u>m ) : 

making the whole parallel sequences XA(I^ from the extracted symbolic sequence Ij; and 

repeating said two steps with shifting an initiation point at which the symbolic sequence Ij extracted from the 

symbolic sequence Mq. 

The method according to Claim 2, further comprising the st^ of: 

making the symbolic sequence Ij by taking out m symbols sequentially at every q from a symbolic sequence 
Lg (s=1 totandt>mq). 
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The method according to Claim 1, further comprising the st^ of: 

making the symbolic sequence Ij by taking out m symbols sequentially at every q from a symbolic sequence 
Lg (s=1 to t and t>nfKi ). 

An apparatus for manifesting characteristic existing in a symbolic sequence 1 j (j = 1 - mj , comprising: 
a means for memorizing the symbolic sequence Ij. 

a means for effecting conversion to a parallel sequence A{k) of partial symbolic sequences in which the suffix 
j is aligned in the following positional relation: 

J= 1, 2, k-1, k 

3= k+2, k+k-1, k+k 



J= (n-l)k+l, (n-l)k+2, (n-l)k+k-l, (n-l)k+k 

3= nk+1, nk+2, nk+k-1, nk+k 

or 

3= 1, 2, / k-1, k 

j- k+k, k+k-1, k+2, k+1 



3= (n-l)k+k, (n-l)k+k-l, (n-l)k+2, (n-l)k+l 

3= nk+1, nk+2, nk+k-1, nk+k; 



a means for making a whole parallel sequences 2:A(k) by further parallel-positioning of parallel sequences 

A(p). A(p+r), A(p+2r). A(p+3r) converted by changing kto p, p+r, p+2r, p+3r. .when p represents a 

natural number from 2 to less than m, and r represents an any natural number; and 

a means for outputting the obtained whole parallel sequences EA(k), wherein n represents an integer such that 
nk<msnk+k , and when the suffix j is m+1 or more, the processing Is ignored. 

An apparatus for manifesting characteristic existing in a symbolic sequence 1 j G = 1 - m) , comprising: 

a means for memorizing the symbolic sequence Ij. 

a means for effecting conversion to a parallel sequence A(k) of partial symbolic sequences in which the suffix 
j is aligned in the following positional relation: 
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j= 1, 2, k-1, k 
j= k+l, lc+2, k+k-1, k+k 
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J= (n-l)k+l, (n"l)k+2, (n-l)k+k-l, (n-l)k+k 

j= nk+1, nk+2, nk+k-1, nk+k 
or 

J= 1, 2. k-1, k 



j= k4.k, k+k-1, k+2, k+1 



j= (n-l)k+k, (n-l)k+k-l, (n-l)k+2, (n-l)k+l 

3= nk+1, nk+2, nk+k-1, nk+k; 



and 

a means for outputting the converted parallel sequence A(k) using one or more expression means selected 
from hue, lightness and saturation of color and from interval, tone and volume of sound; 
wherein, k represents an integer of 2 or more, and n represents an integer such that nk<m^nk+k . and when 
the suffix j is m+1 or more, the processing is ignored. 
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